For  Reference 


NOT  TO  BE  TAKEN  FROM  THIS  ROOM 


CUC  SJBBIS 

SITUSIBSTOJIS 

MttmMEBSlS 


University  of  Alberta 
Printing  Department 


Digitized  by  the  Internet  Archive 
in  2019  with  funding  from 
University  of  Alberta  Libraries 


https://archive.org/details/Hochachka1962 


THE  OTIVERSITX  OF  ALBERTA 


\%  aCFi 


a  3f 


.RELATIONSHIP  BETWEEN  EXTINCTION  RESPONDING  AND  CONDITIONING  REINFORCEMENTS 

i 


By 


NADIA  HOCHACHKA. 


l 


A  THESIS 

SUBMITTED  TO  THE  FACULTY  OF  GRADUATE  STUDIES 
IN  PARTIAL  FULFILMENT  OF  THE  RETIREMENTS  FOR  THE  DEGREE 

OF  RASTER  OF  SCIENCE 


DEPARTMENT  OF  PSYCHOLOGY 


EDMONTON,  ALBERTA 


AUGUST  20,  1962 


Abstract 


The  purpose  of  this  investigation  was  to  ascertain  to  what 
extent  "resistance  to  extinction"  is  a  function  of  conditioning 
reinforcements  when  spurious  responses  (accidental  bar-presses  due  to 
activity)  are  not  recorded*  If  there  should  occur  a  positive  correla¬ 
tion  between  "input"  (conditioning  reinforcements)  and  "output" 
(extinction  responses),  further  analysis  of  extinction  data  was 
planned  in  order  to  establish?  (l)  when  during  extinction  maximum 
contingency  between  "input"  and  "output"  occurs,  and  (2)  how  long  this 
contingency  continues* 

Pretraining  (approximately  50  pellets)  was  given  to  120 
male  rats  so  that  each  subject  earned  one  reward  by  pressing  an 
inconspicuous  lever.  Each  subject  was  then  trained  until  0,  4,  12, 

36,  88,  or  108  reinforced  bar-presses  had  occurred*  Extinction  for 
either  2  or  4  hours  occurred  on  the  following  day. 

Since  the  operant  response  level  (response  level  of  control 
subjects)  was  virtually  zero,  the  object  of  recording  only  responses 
due  to  learning  was  apparently  achieved*  Number  of  extinction 
responses  did  reveal  a  reliable  increase  as  number  of  conditioning 
reinforcements  increased,  from  zero  to  4  to  12*  Additional  rein¬ 
forcements  (from  12  to  108)  did  not  reliably  increase  the  number  of 
extinction  responses*  The  obtained  differentiation  between  groups 
occurred  primarily  during  the  first  6  minutes  of  extinction,  as  in¬ 
dicated  by  measures  of  number  of  responses  per  minute. 

The  data  emphasize  the  importance,  to  assessing  the  effects 
of  number  of  reinforcements,  of  measures  very  early  in  extinction. 

They  question  the  usually  assumed  relation  between  persistence  of  a 

iii 


habit  and  number  of  conditioning  reinforcements*  Since  there  occurred 
during  extinction  a  temporary  burst  of  responding  in  proportion  to 
number  of  reinforcements,  except  for  the  maximum-reinforcement  group, 
an  attempt  was  made  to  relate  these  findings  to  the  constructs  of 
inhibition  and  frustration.  An  apparent  reduction  in  resistance  to 
extinction  in  the  highest-reinf orcement  group  should  be  investigated 
by  an  extension  of  the  reinforcement  parameter  considerably  beyond  108. 
If  extinction  responding  should  be  found  to  relate  to  number  of  rein- 
for  cements  in  a  non-monotonic  fashion,  it  might  be  necessary  to 
re-examine  the  practice  of  using  resistance  to  extinction  as  a 
measure  of  habit  strength. 


iv 


Acknowledgeme nt  s 


Throughout  this  study.  Dr. ’ s  Miles  and  Uhl  have  given  me 
many  valuable  hours,  providing  direction  8s  criticism,  and  constant 
support.  I  am  warmly  appreciative  to  them. 

I  am  grateful  also  to  Mr.  Ritchie  of  the  Department  of 
Extension  for  providing  and  splicing  film  strips  for  our  timer,  and 
to  Walter  Corfield  for  his  active  concern  with  my  apparatus  problems. 

A  constant  participant  in  the  experiments  was  Gary  Dean 
who  patiently  tamed  most  of  my  animals  -  Thank  you,  Gary.  Thanks 
also  to  all  of  my  fellow  students  who's  discussions  have  been  both 
helpful  and  enjoyable. 


v 


Table  of  Contents 


Page 

Abstract  . . iii 

Acknowledgements . v 

Table  of  Contents  . . . .  vi 

List  of  Tables  and  Figures  . . .  vii 

Introduction 

General  and  Theoretical  Considerations  . . .  1 

Purpose  . . . . ••••••••• .  8 

Method 

Apparatus  . 10 

Subjects . . . . . •••••... . .  10 

Procedure  . . 10 

i 

Results  . . 13 

Discussion  . . 19 

Summary  ...... ...... ........  . 28 

References  . . 29 


vi 


List  of  Tables  and  Figures 


Tables  Page 

1.  Summary  of  trend  analysis  of  group  response 

rates  during  two  hours  of  extinction . .  13 


Figures 

1.  Mean  response  rates  (responses /time)  during 
two  hours  each  of  extinction  and  spontaneous 
recovery;  comparisons  of  means  by  Duncan's 
New  Multiple  Range  Test  —  means  underscored 
by  the  same  line  do  not  differ  significantly 

from  those  not  so  underscored  . .  14- 

2.  Mean  response  rates  (responses/time)  within 
recorded  extinction  intervals  as  a  function 
of  time;  comparisons  of  means  by  Duncan’s 
New  Multiple  Range  Test  --  means  falling 
within  the  same  enclosure  do  not  differ 
significantly  from  each  other  but  do  differ 

from  means  outside  that  enclosure  . .  15 

3.  Mean  cumulative  number  of  responses  during 
various  intervals  of  extinction  as  a  function 

of  number  of  reinforcements  .  1 6 


vii 


Introduction 


General  and  Theoretical  Considerations 

A  technique  which  is  frequently  used  for  investigating  simple 

learning  processes  is  referred  to  as  operant  conditioning*  An  organism 

"  .  •  1  *■ 

responds  to,  or  "operates  on"  its  environment  and  those  responses  which 
procure  reward,  or  reinforcement,  "become  strengthened,  i.e.  response 
frequency  or  probability  increases.  For  example,  a  .white  rat,  when 
placed  in  a  box  having  a  depressable  bar  attached  to  one  of  its  walls, 

v  i  ‘j 

will  most  likely  investigate  it  and  eventually  press  it.  If  the  rat  is 
rewarded  immediately  with  a  piece  of  food  following  its  bar-press  res¬ 
ponse,  it  is  more  likely  that  the  bar  will  be  pressed  again,  with  re- 

r 

I 

inforcing  effects  accumulating  until  a  persistent  response  tendency 
develops.  In  other  words,  the  rat  acquires  a  bar-pressing  habit.  Once 
reinforcement  no  longer  follows  the  bar-presses,  the  strength  of  this 
response,  or  the  frequency  with  which  it  occurs,  diminishes.  This  pro¬ 
cess  of  emitting  a  previously  rewarded  response  without  reinforcement  is 
termed  extinction,  and  is  a  common  measure  of  the  strength  of  a  habit. 

Habit  strength  is  generally  thought  to  increase  monotonically 
as  the  number  of  reinforced  acquisition  trials  increases.  While  this 
notion  is  implicit  in  the  statements  of  many  investigators  (Williams, 

.  I  - i 

19381  Perin,  19^2j  Campbell,  19592  Harris  and  Nygaard,  1961),  it  is 

i 

stated  explicitly  by  Hull  (19^3)  who  conceives  of  it  as  a  negative 

1  L 

growth  function  of  the  number  of  reinforcements.  Because  habit  is  an 
intervening  variable  (Hull  considers  it  a  neurological  process),  it  can¬ 
not  be  measured  directly,  but  must  be  inferred  from  antecedent  operations 
and  resulting  behavior  changes.  One  of  the  most  common  ways  of  express¬ 
ing  it  is  in  terms  of  the  progressive  decrease  in  responding  during 


1 


extinction,  measured  variously  as  changes  in  response  latency,  and  time 
or  number  of  responses  to  an  extinction  criterion.  In  Hull's  system, 
habit  strength  interacts  with  a  number  of  other  factors  to  determine  the 
strength  of  the  subject's  tendency  to  perform  the  learned  response,  i.e. 
reaction  potential,  which  presumably  is  reflected  in  observable  behavior, 
and  which  has  been  represented  as  follows: 

gE^  =DxVxKxJx  gHR  -  IR  -  gIR. 

Each  of  these  factors  is  in  turn  a  function  of  one  or  more  antecedent 
variables.  D,  drive,  is  a  function  of  period  of  deprivation;  V,  stimulus 
intensity  dynamism,  is  a  function  of  stimulus  intensity;  K,  incentive 
motivation  factor,  is  a  function  of  w  or  magnitude  of  reinforcement;  J, 
delay  in  reinforcement,  is  a  function  of  t,  or  time  of  delay;  gHR,  habit 
strength,  is  a  function  of  N^,  or  number  of  reinforcements;  IR,  reactive 
inhibition,  is  a  function  of  work  or  number  of  responses  made;  gIR, 
conditioned  inhibition,  is  a  habit  of  ’’not  responding”  which  depends 
upon  its  reduction  of  the  need  for  rest  (lR)  which  increases  with  the 
number  of  non-reinforced  trials. 

Since  our  chief  concern  is  with  the  last  three  factors,  the 
remaining  ones  will  be  treated  as  a  constant.  Consider  first  gH^.  In 
determining  the  presumptive  quantitative  nature  of  the  functional  rela¬ 
tion  of  habit  or  gHp,  to  number  of  reinforcements,  N^,  Hull  has  drawn 
upon  empirical  data  such  as  that  of  Youtz  ( 1938a),  Williams  (193&)*  and 
Perin  (19^2),  who  trained  large  numbers  of  rats  to  bar-press  for  a  food 
reward.  Various  groups  were  allowed  different  numbers  of  such  rewarded 
acquisition  trials,  then  all  subjects  were  extinguished  to  a  criterion 
of  extinction  of  5  minutes  without  a  response.  In  Perin' s  study, 
response  latencies,  extinction  time,  and  number  of  non-rewarded  trials 


2 


^  rf.?.'  ,t8 


to  reach  the  criterion  were  measured.  All  of  these  three  measures  in¬ 
dicated  that  extinction  responding  was  stronger  for  groups  which  had 

experienced  greater  numbers  of  reinforced  training  trials.  This  rela- 

< 

tion,  however,  was  apparent  only  up  to  a  point,  i.e.  about  90  reinforce¬ 
ments.  As  number  of  reinforced  trials  increased,  differences  in  habit 
strength  decreased  and  approached  an  asymptote.  In  other  words,  the 
ratio  of  non-reinforced  responses  to  number  of  previous  reinforcements 

r  ,  i  • 

decreased  progressively,  with  this  function  forming  a  curve  with  neg¬ 
ative  acceleration.  Frem  these  observations,  as  well  as  from  others 
involving  conditioning  of  GSR,  nonsense  syllables  etc.,  Hull  formulated 

his  Law  of  Habit  Formation.  "If  reinforcements  follow  each  other  at 

.  \ 

evenly  distributed  intervals,  everything  else  constant,  the  resulting 
habit  will  increase  in  strength  as  a  positive  growth  function  of  the 
number  of  trials  according  to  the  equation, 

t. 

3%  =  1  -  10  -°3°5N, 

where  N  is  the  total  number  of  reinforcements  from  Z."  (Hull,  1952, 

I 

p.  6). 

The  basic  principle  of  the  siitfple  positive  growth  function 

*  1  J 

was  chosen  as  an  appropriate  algebraic  egression  of  habit  because  it 
is  known  to  approximate  closely  a  large  number  of  observable  empirical 
relationships  in  many  types  of  biological  situations  involving  growth 

and  decay.  Thus,  Hull’s  conception  of  habit  formation  is  summarized 

./  •  *• 

generally  as  follows;  "the  amount  of  growth  resulting  from  each  unit 
of  growth  opportunity  will  increase  the  amount  of  whatever  is  growing 
by  a  constant  fraction  of  the  growth  potentiality  as  yet  unrealized." 
(Hull,  19^3,  p.  114).  More  specifically,  suppose  a  maximum  habit 
strength  is  100  units  and  the  growth  constant  in  a  given  reinforcement 


3 


j  v  ,  :  v  v  i.  ♦nol 


situation  is  l/lO.  Then  generation  of  l/lO  x  100  =  10  units  of  habit 
on  the  first  trial  leaves  9 0  units  of  potential  growth.  The  habit  in¬ 
crement  resulting  ,from  the  second  reinforcement  must  be  l/lO  x  90  =  9 
units,  leaving  8l  potential  units $  on  the  third  trial  l/lO  x  8l  =  8,1 
units  are  subtracted.  This  process  can  be  repeated  as  many  times  as 
there  are  successive  reinforcements. 

Effective  reaction  potential  or  gE^,  i.e.  the  potential 
actually  available  for  the  evocation  of  action  is  gE^  minus  the  inhib- 

r-  t- 

itory  potential.  The  latter  is  the  concept  by  which  Hull  explains  ex- 

•f 

perimental  extinction.  He  begins  with  the  hypothesis  that,  ’’Whenever 

l 

any  reaction  is  evoked  in  an  organism  there  is  left  a  condition  or 

state  which  acts  as  a  primary  negative  motivation  in  that  it  has  an 

* 

innate  capacity  to  produce  a  cessation  of  the  activity  which  produced 

i  • 

the  state."  (Hull,  19*4-3  *  P«  278).  Hull  calls  this  state  reactive 
inhibition,  IR,  another  logical  construct  which  is  observable  through 
its  effect  upon  response  measures  of  positive  reaction  potentials. 

According  to  Hull,  every  repetition  of  a  response  generates 

T  ¥■ 

I  •  .1 

an  increment  in  IR  which  dissipates  with  the  passing  of  time.  Because 
of  the  negative  motivational  character  of  IR  anything  which  reduces 
this  need  to  cease  action  should  be  reinforcing.  Thus  cessation  of 

action  becomes  conditioned  to  whatever  stimuli  may  be  present,  forming 

l+- 

*  t 

a  genuine  habit  of  "not  responding",  i.e.  conditioned  inhibition  (gl^)* 
which  combines  physiologically  with  any  IR  present  on  a  given  trial  to 
subtract  from  gEp  and  to  determine  the  effective  reaction  potential  gE^. 
Because  gIR  is  a  relatively  permanent  habit,  spontaneous  recovery  of 
reactive  potential  upon  dissipation  of  is  not  complete. 

Extinction  is  described  by  Spence  and  by  Amsel  in  terms  of 


k 


inhibition  of  the  learned  instrumental  response  by  a  frustration-produced 
response  which  becomes  stronger  than  the  instrumental  one.  Spence's 
theory  of  extinction  may  be  summarized  as  follows: 

1.  Non-reinforcement  of  a  previously  reinforced  response  results 
in  an  emotional  (’'anger")  state  or  response  which  is  designated  by  Amsel 
as  Tf,  and  is  assumed  to  contribute  to  the  general  drive  levels  D,  of 
the  organism. 

2.  Occurrence  of  r^  is  assumed  to  depend  upon  the  prior  develop¬ 
ment  in  instrumental  reward  learning  of  the  expectation  of  reward,  i.e. 

i  ,  , 

the  prior  development  of  a  fractional  anticipatory  consummatory  reaction 
rg  the  strength  of  which  bears  a  positive  relation  to  the  strength  of 

rf  * 

3.  Again  following  Amsel,  rf  not  only  would  occur  at  the  end  of 

1 

the  response  chain  but,  like  the  fractional  anticipatory  goal  reaction, 
would  became  conditioned  to  stimulus  events  earlier  in  the  response 
chain. 

4.  It  is  assumed  that  during  experimental  extinction  the  frus¬ 
tration-aroused  response  r~  through  its  own  response -produced  cues,  s^, 

r,  (  ■ 

tends  to  elicit  previously  learned  or  unlearned  overt  responses,  some 
of  which  are  incompatible  with  the  learned  instrumental  response.  With 

i.  r 

repetition  of  unreinforced  trials,  these  competing  responses  would 
gradually  become  more  strongly  conditioned  to  the  situation  and  con¬ 
sequently  would  compete  more  and  more  successfully  with  the  learned 
response  sequence,  producing  the  typical  curve  of  decrement  in  response 
strength.  (Spence,  i960,  p.  98)* 

I 

A  picture  of  habit  strength  unlike  any  of  the  above  has  been 
presented  by  several  recent  studies  which  have  failed  to  show  extinction 


5 


responding  as  an  increasing  monotonic  function  of  increasing  numbers  of 

i 

conditioning  reinforcements.  Some  of  these  have  in  fact  showed  the  ex¬ 
tinction  function  to  take  the  form  of  an  inverted  U.  Finger  ( 1942a), 

I 

comparing  number  of  running  responses,  in  an  elevated  runway ,  of  sub- 

I  z 

jects  extinguished  immediately  following  16  or  8  rewarded  acquisition 
trials,  found  the  most  rapid  initial  increase  in  latencies,  and  fewer 
total  responses  in  the  16-group  than  the  8-group.  These  differences 
were  not  observed  when  he  delayed  extinction  for  24  hours  (1942b). 

A  similar  runway  response  was  used  by  Mote  (1944)  to  test  a 
larger  range  of  the  reinforcement  parameter.  Latency  (starting  time) 

I 

measures  at  the  end  of  acquisition  differentiated  only  3 “-reward  sub- 

i 

jects  from  others  who  had  had  either  12,  18,  or  2k  rewarded  acquisition 
trials.  Extinction  curves  of  steeper  slopes  with  increasing  numbers  of 

i 

rewarded  acquisition  trials  suggested  a  positive  correlation  between 
rate  of  increase  of  response  latencies  and  number  of  previously  re¬ 
warded  trials,  i.e.  more  rapid  extinction  following  greater  numbers  of 
re  i  nf  or  ceme  nt  s . 

Mote  and  Finger  (1943)  also  found  that  1,6-  and  32-reward 
groups  extinguished  more  rapidly  than  those  with  k  or  8  previous  re¬ 
wards  j  and  observations  by  Youtz  (l93&b)  over  several  extinction  sessions 
confirmed  these  trends. 

North  and  Stimrael  (i960)  allowed  rats  45,  9 0 ?  or  135  re- 

( 

inforced  acquisition  trials  in  a  straight  runway.  In  6 0  extinction 
trials,  the  starting  and  running  times  of  the  90-  and  135-reward  groups 
increased  more  rapidly  than  those  of  the  45 -reward  group,  leading  the 
authors  to  postulate  that  overlearning  facilitates  extinction,  produc¬ 
ing  a  non-monotonic  relation  between  resistance  to  extinction  and  number 

6 


of  conditioning  reinforcements . 

These  experiments  tend  to  support  Mote's  suggestion  (19MO  of 
a  maximum  "habit  strength"  resulting  from  a  relatively  small  number  of 
rewarded  acquisition  trials.  There  is  some  indication  that*  with  a 
more  extensive  sampling  of  the  reinforcement  parameter,  there  is  a  range 
of  number  of  reinforcements  which  is  optimal  with  respect  to  strength  of 
that  response  tendency,  and  reinforcement  beyond  this  range  again  de¬ 
creases  the  probability  of  that  response  in  extinction. 

Some  support  to  this  idea  stems  from  the  results  of  Murillo 
and  Capaldi  (1961)  with  human  subjects  (undergraduates)  tested  in  the 
Wisconsin  General  Test  Apparatus.  "Learners"  were  defined  as  those  sub¬ 
jects  which  achieved  7  consecutive  correct  responses  before  extinction. 
Considering  the  entire  sample,  resistance  to  extinction  decreased  with 
increased  training,  but  this  trend  seemed  to  be  due  to  the  behavior  of 
the  learners  only.  The  non-learners  showed  increased  resistance  to 
extinction  with  increasing  reinforcements.  The  authors  suggest  that 
resistance  to  extinction  appears  to  increase  with  increased  training 
only  up  to  the  point  where  no  overlearning  has  occurred,  and  that  it 
decreases  with  additional  training  after  learning  has  occurred,  with 
the  resulting  curvilinear  function. 

t- 

It  seems  clear  that  the  precise  quantitative  nature  of  the 
relation  between  habit  strength,  measured  as  resistance  to  extinction, 
and  number  of  reinforced  acquisition  trials  has  not  been  established 

L 

satisfactorily.  A  subject  will  press  the  bar  even  when  no  reinforcement 
for  it  has  been  forthcoming.  Schoenfeld,  Antonitis,  and  Bersh  (1950) 
observed  the  unconditioned  response  rate  of  both  hungry  and  thirsty 
white  rats  in  a  bar- -pressing  apparatus.  Given  1-hour  sessions  each  day. 


7 


■ 


the  animals  exhibited  a  consistent  decrease  in  mean  number  of  responses 
and  degree  of  variability.,  After  only  2  or  3  daily  sessions,,  a  fairly 
stable  operant  level  was  reached.  Within  each  experimental  hour  also, 
extinction-like  trends  were  apparent.  Such  observations  tend  to  suggest 
that  some  of  the  instrumental  responses  made  during  extinction  are  not 
strictly  due  to  learning,  but  are  accidental,  i.e.  part  of  an  uncondi- 
tioned  operant  reserve.  Perhaps  it  would  be  feasible  to  obtain  a  more 
unequivocal  measure  of  the  effect  of  reinforcements  per  se  with  an 
apparatus  designed  so  that  fewer  responses  would  be  recorded  during  the 
subject's  random  movements. 

Further,  most  of  the  studies  investigating  extinction  res¬ 
ponding  following  varied  numbers  of  reinforcements  have  used  criteria 
of  extinction,  and  have  not  observed  the  nature  of  the  relation  between 
strength  of  extinction  responding  and  number  of  conditioning  reinforce¬ 
ments  over  intervals  other  than  those  which  met  some  rather  arbitrary 
extinction  criterion.  Presumably,  whatever  relation  was  found  at  that 
time  was  considered  to  be  the  representative  function,  without  concern 
for  possible  changes  over  time. 

Purpose 

The  present  study  was  undertaken  t©  delineate  the  effects  of 
various  numbers  of  initial  conditioning  reinforcements  throughout  a 
lengthy  period  of  extinction.  Data  from  the  previously  reviewed  re¬ 
search  have  indicated  that  subjects  experiencing  large  numbers  of  rein¬ 
forcements  in  comparison  with  groups  receiving  few,  emit  a  greater 
number  of  extinction  responses.  Therefore,  some  kind  of  contingency 
was  expected  between  "input"  -  number  of  reinforcements  -  and  "output" 

-  extinction  responses.  During  extinction,  successive  samples  of 


8 


•  ' 

* 


responding  were  obtained  at  different  time  intervals  in  order  to 
establish  when  and  for  how  long  contingencies  were  manifest.  It  would 
appear  that  this  function  -  the  one  characterized  by  the  highest 
" input- output”  contingency  -  has  the  most  critical  implication  for 
hypothetical  theory  construction. 


9 


r 


Method 


Apparatus 

The  equipment  consisted  of  two  Skinner  boxes,  (9  x  l4  x  12 
inches)  with  a  -wide  clear  plastic  bar  protruding  approximately 
and  a  l~inch  wide  metal  food  tray  about;  an  inch  below  and  2"  to  one 
side  of  it.  The  box  was  enclosed  within  a  chamber  which,  when  closed, 
eliminated  visual  stimulation  and  dampened  extraneous  sound.  A  magnetic 

counter  in  the  next  room  cumulatively  counted  the  number  of  lever 

/ 

presses# 

Subjects 

The  experimental  animals  were  120  male  albino  rats  of  the 
Sprague  -Dawley  strain.  48  were  experimentally  naive j  72  had  been  used 
in  an  experiment  where  activity  measures  were  taken.  Their  ages  ranged 
from  70  to  120  days. 

Procedure 

Animals  were  kept  in  groups  of  10  to  20  until  pretraining, 
when  they  were  given  separate  living  cages  and  assigned  to  reinforce¬ 
ment  groups  by  restricted  randomizations  to  insure  that  each  condition 
was  represented  in  all  replications  of  the  experiment.  Assignment  to 
the  2  Skinner  boxes  was  similarly  randomized  within  each  replication. 
The  same  box  was  used  throughout  the  experiment  for  a  given  subject. 

Taming  and  habituation  of  naive  subjects  involved  7  days  -  4 
days,  at  10  minutes  per  day,  of  handling  and  exploration  of  table  tops, 
and  3  days  on  which  subjects  were  placed  in  a  closed  rectangular  box 
(similar  to  that  enclosing  each  Skinner  box)  until  they  had  eaten  about 
20  small  food  pellets  scattered  about  on  the  floor.  The  first  four  days 
of  taming  were  eliminated  for  subjects  which  had  previously  been  handled 


10 


' 


in  the  activity  experiment. 

The  day  following  completion  of  taming  and  habituation,  each 
subject  was  given  20  small  pellets  in  the  Skinner  box,  one  at  a  time. 

The  first  10  were  delivered  without  the  relatively  loud  click  of  the 
delivery  mechanism,  by  manual  rotation  of  the  pellet  dispenser;  the 
remaining  10  were  delivered  automatically  by  the  closing  of  a  switch  to 
activate  the  pellet  dispenser. 

Training  by  successive  approximations  began  on  the  following 
day.  Each  rat  was  rewarded  for  responses  which  brought  him  progressively 
closer  to  the  bar,  until  one  bar-press  was  emitted. 

On  the  day  after  pretraining,  each  subject  was  conditioned 
with  one  of  the  following  numbers  of  rewarded  bar -presses:  0,  4,  12, 

36,  88,  108.  ^  To  hasten  bar -pressing,  3  to  6  ’’free”  pellets  were  given 
initially.  Where  this  proved  insufficient  to  initiate  pressing  within 
about  15  minutes,  or  if  conditioning  did  not  occur  after  an  initial  bar- 
press,  the  subject  was  discarded.  Control  subjects  were  given  5  "free” 
pellets,  but  not  allowed  to  press  the  bar. 

Etxtinction  for  2  or  4  hours  (N=78  for  4-hours)  took  place 
the  next  day  in  the  Skinner  box  with  the  dispenser  disconnected  so  that 
bar -pressing  no  longer  produced  reward.  On  the  day  after  extinction, 

66  of  the  subjects  were  reintroduced  to  the  apparatus  for  a  2-hour  test 
of  spontaneous  recovery.  Cumulative  numbers  of  responses  during 
extinction  and  spontaneous  recovery  were  recorded  at  the  following 
intervals: 

1  With  the  exception  of  the  first  38  subjects,  this  conditioning  was 
carried  out  with  the  box  closed,  t  tests  indicated  no  significant  diff¬ 
erences  in  the  mean  number  of  responses  made  by  subjects  which  had  been 
conditioned  with  lid  up  and  with  it  down. 


11 


$  0  3  .  i 


1  min.,  3$  &s>  9 s  l8>  27 ,  4-0,  60,  80,  100,  120  min,  (two  series  comprised 
the  4-hour  periods). 

Deprivation  time  was  measured  from  the  end  of  a  1-J-hour  feed¬ 
ing  period.  It  was  l6  -  20  hours  during  pre training  and  conditioning, 
and  22  hours  during  extinction  and  spontaneous  recovery. 


12 


■ 


Results 


Since  the  operant  response  level  was  virtually  zero,  the  ob¬ 
ject  of  recording  only  responses  due  to  learning  was  apparently  achieved. 

The  results  of  a  trend  analysis,  summarized  in  Table  1,  show 
that  extinction  responding  was  influenced  by  number  of  conditioning  re¬ 
inforcements  (p<  .005).  Fig.  3  shows  cumulative  number  of  extinction 
responses  at  different  times  for  each  group,,  and  Fig.  1  shows  group  mean 
response  rates  (responses/time)  over  the  entire  2-hour  extinction  period. 
In  general,  increasing  numbers  of  conditioning  reinforcements  resulted 
in  increasing  numbers  of  responses  per  minute,  the  highest  response 
rate  being  that  of  the  88 -reinforcement  group.  The  significance  of  the 
difference  between  groups  was  tested  by  means  of  Duncan 8 s  New  Multiple 
Range  Test,  illustrated  in  Fig.  1.  Groups  which  are  underscored  by  the 

1  » 

same  line  were  not  significantly  different  from  each  other  hut  were 
different  from  all  groups  which  are  not  so  underscored  (p<.Qi).  No 
differences  occurred  among  the  four  high-reinforcement  groups  (12  -  108) 
which  all  differed  reliably  from  the  control  group  and  from  the' 4— rein¬ 
forcement  group,  with  the  exception  of  the  12-re  infer  cement  group.  The 
12-  and  4 -re inf or cement  groups  did  not  differ  reliably. 


Table  1.  Summary  of  treqd  analysis  of  group  response 
rates  during  two  hours  of  extinction. 


Source  of  variation 

Mean  Square 

Degrees  of  freedom 

F  . 

Reinforcements 

143.06 

5  f 

19.23* 

Error 

7.44 

ll4 

Time 

250.01 

10 

73.10* 

Reinforcements  x  time 

17.48 

50 

5.11* 

Error 

3.4-2 

1140 

■ 

*  p<  .005 


13 


CO 

3 

o 


o 

o 


o 

in 

o 


(SUflOH  2) 

N0I1DNHX3  ONiana 

3V\ll±/S3SN0dS3d  30  NV3IN 


}— 

CO 

LlJ 

h- 

(f) 

< 

o 


1 


(X) 

o 


00 

00 


CO 


OJ 


o 


o 


NUMBER  OF  REINFORCEMENTS 


REINFORCEMENTS  SYMBOL 


S1VA331NI  NIH1IM 
3V\ll±/S3SNOdS3d  30  NV3W 


(\J 

(D 

Ll_ 


TIME,  (min.) 


o  o'o'ob 

2^2  00  CD 


CJ 


22  o> 


CD 


ID 

CVJ 


o 

o 


lO 

Is- 


o 

ID 


lO 

CVJ 


23SN0dS3d 

30  d39lAinN  3AllVnniAinO  NV3IAI 


In  Fig.  1,  mean  response  rates  for  each  group  during  a  2-hour 

* 

test  of  spontaneous  recovery  on  the  day  following  extinction  show  that 
the  88-  and  108- re inf or cement  groups  still  exhibited  a  slightly  strong¬ 
er  response  tendency  than  the  others.  No  statistical  treatment  was 
applied  to  the  spontaneous  recovery  data  because  response  rates  of 
most  of  the  subjects  in  every  group  were  zero  so  that  the  group  distri¬ 
butions  were  markedly  skewed  positively. 

The  effect  of  reinforcements  upon  extinction  responding  is 
illustrated  again  in  Fig.  2  where  mean  response  rates  of  each  group 
are  plotted  over  time.  A  one-way  analysis  of  variance  of  responding 
in  each  recorded  interval  indicates  that  the  reinforcement  effect 
remained  significant  throughout  the  first  hour  of  extinction  ( p  < . 005 
for  the  first  27  min.  and  p<„Ql  thereafter  to  60  rain.).  However,  not 
all  comparisons  of  groups  adjacent  to  each  other  in  number  of  rein¬ 
forcements  were  significant.  The  results  of  multiple  comparisons,  by 
Duncan’s  Test,  of  mean  response  rates  during  each  extinction  interval 
are  shown  in  Fig.  2.  Group  means  which  fall  within  the  same  enclosure 
did  not  differ  significantly  from  each  other  (p<.0l)  but  did  differ 
from  all  means  outside  that  enclosure. 

It  is  apparent  that  significant  differences  between  experi- 

/  i 

mental  groups  were  found  early  in  extinction.  While  there  was  no 
significant  difference  between  the  4 -reinforcement  group  and  the 
controls  at  any  time,  the  mean  response  rate  of  the  4 -re inf or cement 
group  was  consistently  above  that  of  the  controls.  The  greatest 

i  '  ■  '  *  ^ 

differences  occurred  in  the  interval  between  1  and  3  minutes,  when  the 
4 -re infer cement  group  differed  reliably  from  all  other  experimental 
groups  except  that  with  12  reinforcements.  The  12-reinforcement  group 

17 


in  turn  differed  significantly  from  the  88 -reinforcement  igroup*  but  not 
from  any  of  the  other  experimental  groups.  At  no  other  time  in  extinc¬ 
tion  (i.e.  only  in  this  2-minute  interval  between  1  and  3  min.)  were 
there  any  significant  differences  among  the  four  groups  with  12  -  108 
reinforcements.  Beyond  6  minutes  *  the  4- -re  inf  or  cement  group  also 

ceased  to  differ  from  any  other  group.  During  the  rest  of  extinction* 

1  / 

gradually  fewer  of  the  experimental  groups  differed  reliably  from  the 
control  group  so  that  all  response  rates  were  similarly  low  for  most 
of  the  time  beyond  60  minutes. 

The  decline  in  response  rate  during  the  course  of  extinction 

*• 

was  reflected  in  the  significant  time  effect  (p<.005).  A  significant 
reinforcement  x  time  interaction  indicated  that  the  change  in  rate  of 

}  i 

i  i 

response  over  time  differed  amongst  the  groups,,  as  seen  in  the  different 

J 

slopes  of  the  extinction  curves  over  time.  The  curve  for  the  4 -rein¬ 
forcement  group  reached  its  peak  at  9  min.  and  was  generally  flatter 
than  those  of  the  other  experimental  groups  which  reached  their  peaks 

'  i 

earlier  (at  3  min.). 

Separate  trend  analyses  were  performed*  comparing  groups  two 

-  4- 

at  a  time.  The  4 “reinforcement  group  differed  reliably  from  other  re¬ 
inforcement  groups  in  the  shape  of  its  response  rate  curve  over  time* 
i.e.  there  was  a  significant  reinforcement  x  time  interaction  (p<„005) 
between  the  4 -reinforcement  group  and  each  other  group.  This  held  true 
for  the  control  group  too*  but  none  of  the  other  reinforcement  group 
comparisons  produced  a  significant  interaction  effect.  It  seems  that 
the  reinforcement  x  time  interaction  arose  mainly  from  the  different 
slopes  of  the  response  rate  curves  of  the  controls  and  the  4-reinforce¬ 
ment  group  on  the  one  hand*  and  the  other  four  groups  (12  -  108 
reinforcements)  on  the  other.  18 


Discussion 


Examination  of  the  functions  of  resistance  to  extinction  shows 
that  the  general  result  of  increasing  the  number  of  conditioning  rein¬ 
forcements  was  an  increase  in  the  number  of  instrumental  responses  emit¬ 
ted  during  extinction.  This  was  supported  by  a  trend  analysis  of  group 

h  *• 

i  •  >  r  ' 

response  rates  over  time  and  by  one-way  analyses  of  variance  of  response 
rates  in  each  successive  measured  interval.  However ,  the  extent  of  the 
contingency  varied  during  different  intervals.  In  general,  it  was 
greatest  very  early  in  extinction  and  decreased  gradually  with  time. 
Thus,  the  extinction  function  observed  here  represented  a  steeper  growth 
with  small  numbers  of  reinforcements  (0  -  12)  than  those  of  Williams 
(1938)  and  Per  in  (19*4-2)  which  exemplified  more  closely  Hull’s 
growth  curve. 

Duncan’s  New  Multiple  Range  Test  (Edwards,  i960),  sinrul- 

l 

* 

taneously  comparing  mean  response  rates,  differentiated  a  greater 
number  of  groups  in  the  interval  between  1  and  3  minutes  than  in  any 

J  4 

other  interval  during  which  responding  was  recorded.  This  was  the  only 
interval  in  which  any  reliable  differences  between  pairs  of  groups  were 
found  among  the  four  highest “reinforcement  groups  (12  -  108).  These 
four  groups  seemed  to  differ  as  a  whole  from  the  remaining  experimental 
group  (4  reinforcements)  which  did  not  differ  reliably  from  the  control 
group  in  any  single  interval.  Similarly,  reinforcement  x  time  inter- 

4. 

action,  indicating  different  slopes  for  response  rate  curves,  was 
significant  in  the  comparisons  of  the  4 -re infer cement  group  with  each 
other  group,  though  not  in  any  inter -group  comparisons  among  those  which 
had  experienced  12  -  108  reinforcements.  The  reinforcement  x  time 
interactions  involving  comparisons  of  the  control  group  with  each  of  the 


19 


' 


other  groups  also  all  were  significant. 

Changes,  over  time*  in  the  characteristics  of  extinction  res¬ 
ponding  as  a  function  of  number  of  reinforcements  have  not  been  consider 
ed  in  theoretical  discussions  of  habit  strength.  More  commonly*  res¬ 
ponse  persistence  to  some  criterion  of  low  response  strength  (e.g.  5 
with  no  response),  following  continued  nonreward ,  has  been  taken  as  an 
index  of  habit  strength.  Hull  ( 19  Vj )  considers  that  habit  strength  in¬ 
creases  monotonically  as  a  negative  growth  function  of  number  of  rein¬ 
forcements,  and  that  this  is  reflected  in  the  increasing  numbers  of 
nonrewarded  responses  which  will  be  emitted  before  a  criterion  of  ex¬ 
tinction  is  reached  by  organisms  which  have  experienced  increasing 
numbers  of  reinforced  conditioning  trials.  According  to  Hull,  rein¬ 
forcement  of  a  response  results  in  its  becoming  more  strongly  condi¬ 
tioned  to  available  stimuli.  Through  generalization  and  higher-order 
conditionings  a  fractional  form  of  the  goal  or  consummately  response 
comes  to  he  elicited  by  stimuli  which  antedate  the  goal.  This 
fractional  anticipatory  goal  reaction  and  the  stimuli  arising  from  it* 

i.e.  r  -So  are  assumed  to  increase  in  strength  along  with  the  instru- 
S  g 

mental  response,  with  continued  reinforcement.  While  the  instrumental 
response  seems  to  develop  to  an  asymptote  rather  quickly,  it  could  be, 
as  suggested  by  North  &  Stimmel  (i960),  that  the  fractional  antedating 
response  reaches  its  asymptote  later  (i.e.  after  more  reinforcements) 
than  does  the  instrumental  response. 

According  to  Hull,  continuation  of  responding  is  inhibited 
somewhat  by  the  accumulation,  following  each  instrumental  response,  of 
reactive  inhibition,  IR,  which  dissipates  over  time  when  responding  is 
discontinued.  The  more  rapid  responding  observed  in  the  high- 


20 


reinforcement  groups  early  during  extinction  should  have  resulted  in  a 
greater  accumulation  of  XR,  so  that  their  presumably  greater  original 
gHR  soon  would  have  been  cancelled  out,  because  of  the  subtraction  of 
total  inhibition  from  it,  to  give  effective  reaction  potential,, 
i.e.  gEg  =  sHr  x  C1  -  IR  -  sIRo 

Since  IR  is  conceived  as  a  primary  negative  state ,  or  a  need 
to  cease  responding,  its  diminution  is  reinforcing.  Thus  any  responses 
which  are  incompatible  with  the  original  conditioned  response,  thus 
preventing  its  emission,  are  reinforced  by  the  reduction  of  IR  and  be¬ 
come  conditioned  to  the  cues  which  precede  them.  These  new  modes  of 
behavior,  competing  with  the  original  conditioned  response,  represent 
the  development  of  a  habit  of  ’’not  responding",  gIR.  This  hahit 
develops  gradually,  therefore  dissipation  of  some  of  the  XR  increases 
the  probability  that  the  original  response  will  recur,  with  the  con¬ 
sequent  accumulation  of  more  IR  and  further  strengthening  of  gIR«  But 
gHR  no  longer  is  being  strengthened  so  periodically  recurring  bursts  of 
responding  will  be  weaker  because  of  interference  from  gIR  which  com¬ 
petes  so  successfully  with  that  the  original  response  rarely  occurs. 
In  the  present  study,  very  few  instrumental  responses  were  emitted  by 
the  end  of  extinction,  and  twenty-four  hours  later,  almost  no  spon¬ 
taneous  recovery  of  the  original  response  tendency  occurred. 

In  an  attempt  to  describe  events  in  relation  to  different 
temporal  stages  of  extinction,  Amsel  (1958)  has  added  to  the  Hullian 
inhibitory  factors,  IR  and  gIR,  an  emotional  factor  which  is  another 
source  of  reinforcement  for  behavior  incompatible  with  the  instrumental 

1  A  constant  encompassing  all  the  factors  which  multiply  with  gHR  to 
determine  reaction  potential. 

21 


response.  He  has  suggested  that  the  experience  of  nonrewaxd,  in  a 
situation  which  previously  had  been  rewarding ,  creates  a  frustration 
effect  (FE),  which  results  in  a  temporarily  increased  drive  level.  This 
is  manifested  in  types  of  behavior,  e.g.  biting  the  bar,,  exploration  and 
hyperactivity,  commonly  considered  indicative  of  frustration. 

Since  there  is  evidence  (Wagner,  1959)  that  only  after  some 
minimal  number  of  rewards  will  nonreward  of  a  response  be  frustrating, 
frustration  is  conceptualized  as  being  the  result  of  an  interaction 
between  nonreward  and  a  factor  which  has  been  developing  during  previous 
rewarded  trials,  this  factor  being  the  fractional  anticipatory  goal 
reaction  (rg  -  sg)  already  mentioned.  The  stronger  the  rg  -  sg,  based 

J  I 

on  previous  rewards,  the  greater  should  be  the  FE  resulting  from  non¬ 
reward  (Amsel  &  Hancock,  1957 J  Amsel,  Ernhart,  &  Galbrecht,  1961). 

Thus  the  present  experimental  groups  which  had  received  high  numbers 
of  reinforcements  should  have  exhibited  a  more  pronounced  FE  -  a 
supposition  which  is  consistent  with  their  marked  intensification  in 
responding  from  the  first  to  the  third  minute  of  extinction. 

Since  Amsel  conceives  of  frustration  as  an  aversive  motiva¬ 
tional  condition,  its  reduction,  like  the  reduction  of  XR,  can  serve 
as  a  reinforcement  to  any  behavior  which  interferes  with  the  original 
response.  With  repeated  approaches  to  the  bar,  the  animal  comes  to 
anticipate  the  frastrative,  "aversive"  nonreward,  so  that  a  fractional 
anticipatory  frustration  reaction  begins  to  develop  in  a  manner  similar 
to  the  earlier  development  of  r  -  s  .  Since  the  stimuli  (s  )  arising 

Oo 

from  the  r^  are  aversive,  any  behavior  which  enables  the  animal  to 
avoid  them  is  reinforced  by  frustration-reduction,  and  becomes  condi¬ 
tioned  to  instrumental  cues  which  antedate  the  goal. 


22 


» 


But  these  same  ra.es  elicit  r  -  s  which  leads  the  animal 

O  O 

toward  the  goal ,  so  the  two  antedating  conditioned  responses  are  tempo- 
rarily  in  competition.  Since  the  goal  response  is  no  longer  reinforced, 
but  new  responses  are,  by  the  reduction  of  both  frustration  and  IR, 
these  avoidance  responses  eventually  predominate  over  the  original  in¬ 
strumental  response.  In  other  words,  the  original  conditioned  response 
is  extinguished  through  the  interference  of  the  habit  of  not  responding, 
glR,  which  was  frequently  manifest,  in  the  present  experiment,  through 
subjects’  lying  in  a  corner  opposite  to  the  bar. 

The  addition  of  the  emotional  factor  of  frustration  seemed 
useful  in  accounting  qualitatively  for  the  type  of  behavior  observed 

i 

immediately  upon  withdraw!  of  reinforcement,  as  well  as  for  the  tem¬ 
porary  increase  in  'vigor  of  the  conditioned  response  early  in  extinc¬ 
tion.  However,  the  effects  of  reinforcement  which  were  apparent  in  the 
present  data  early  in  extinction  appeared  to  wear  off  quite  quickly. 
Therefore,  designation  of  habit  strength  in  terns  of  total  number  of 
responses  in  extinction  obscured  the  fact  that  continuing  extinction 
effects  occurred  at  much  the  same  rate  regardless  of  the  number  of 
conditioning  reinforcements.  It  seems  that  response  persistence  in 
extinction  is  not  the  Important  factor  in  the  measurement  of  habit 
strength.  It  Is  not  clear  that  subjects  having  experienced  large 
numbers  of  reinforcements  will  "resist"  extinction  longer  than  those 
having  experienced  relatively  few  reinforcements .  Rather,  temporary 
differences  in  number  of  responses  emitted  per  minute  may  differentiate 
these  subjects  early  in  extinction.  Therefore,  attempts  to  describe 
mathematical  functions  relating  extinction  responding  to  number  of 
reinforcements  probably  should  be  based  upon  measures  of  response  rates 


23 


early  in  extinction,  when  the  effect  of  conditioning  reinforcements  seems 
to  he  greatest. 

There  remains  the  question  of  the  nature  of  the  function  re¬ 
lating  extinction  responding  to  number  of  reinforcements.  Although  the 
overall  trend  was  for  increases  in  extinction  responding  to  be  associ¬ 
ated  with  increasing  numbers  of  reinforcements,  there  was  some  support 
to  the  notion  (Finger,  1942b)  that  maximum  resistance  to  extinction 
occurs  following  fairly  few  reinforced  trials.  (Statistical  calcula¬ 
tions  quite  consistently  differentiated  the  controls  and  the  4— rein¬ 
forcement  group  from  the  remaining  higher  =>re  inf  or  cement  groups  which 
were  generally  non-differentiated  statistically. ) 

Of  greater  interest  is  the  indication  that  the  function  is 
non-monotonic.  The  88 -reinforcement  group  consistently  displayed  a 

p 

stronger  response  tendency,  apparently  stronger  than  the  108-rein- 
forcement  group,  though  the  difference  between  these  two  groups  failed 
to  reach  statistical  significance.  This  observation  parallelled  those 
of  a  number  of  recent  studies  -  none  involving  operant  bar-pressing  - 
which  suggested  that  a  non-monotonic  function  relates  extinction  res¬ 
ponding  to  amount  of  training  (Senho,  Champ,  &  Capaldi,  19&U  Murillo 
&  Capaldi,  196lj  North  &  Stimmel,  i960). 

North  &  Stimmel  (i960)  have  tried  to  account  for  their  find¬ 
ings  of  decreasing  resistance  to  extinction  of  a  runway  response,  from 
45  to  90  and  135  reinforced  trials,  in  terms  of  Amsel’s  frustration 

1 

concepts.  They  assumed  that  the  instrumental  response  reached  its 
maximum  strength  within  45  reinforcements  but  that  r  did  much  later, 

o 

perhaps  between  9 0  and  135  reinforcements.  Since  r^  depends  upon  the 
strength  of  r  ,  the  latter  groups,  upon  withdrawl  of  reward,  should 

O 


l 


24 


Therefore  these 


have  experienced  more  frustration  and  a  stronger  rf. 
groups  should  have  extinguished  more  rapidly  than  the  45 -reinforcement 
group.  The  authors  suggested  that  a  group  which  may  have  experienced 
a  very  small  number  of  reinforcements.,  e.g.  IQ,  also  would  have  ex¬ 
tinguished  in  relatively  few  trials  because  habit  strength  would  have 
been  relatively  small. 

The  observation  that  increased  training  results  in  decreased 
resistance  to  extinction  (Capaldi,  1957!  1958)  has  been  related  to  the 
observation  that  overtraining  also  facilitates  learning  of  a  new  res- 

i 

ponse  in  some  situations.  Capaldi  &  Stevenson  (1957)  trained  rats  on 
a  black -white  discrimination  to  three  successive  criteria  before  re¬ 
training  with  reversed  cues  (if  black  was  previously  reinforced,  it 
became  the  negative  stimulus  and  the  white  became  the  positive  one|  and 
vice  versa).  While  all  three  groups  performed  similarly  at  first,  the 
most  highly  trained  group  soon  became  differentiated  from  the  other  two 
by  a  significantly  faster  decline  in  errors.  The  authors  hypo'fc^sized 
that,  following  many  reinforcements,  the  change  in  the  reinforcement 
pattern  is  greater  than  for  subjects  experiencing  fewer  reinforced 
triads.  Therefore,  extinction  of  the  original  response  should  be  fastest 
for  the  most  highly  trained  subjects,  allowing  for  fastest  acquisition 
of  the  new  response,  i.e.  reversal  of  responding  to  the  opposite  cue. 

It  might  be  that  frustration,  due  to  nonreward,  could  facilitate  such 
rapid  learning  of  a  new  response  to  the  extent  that  it  facilitates  rapid 
extinction,  i.e.  the  reversed  response  could  be  considered  just  one  more 
response  which  is  incompatible  with  the  original  one,  and  is  strength¬ 
ened  both  by  primary  reinforcement  and  by  the  reduction  of  frustration 
arising  from  nonreward  of  the  original  response. 


25 


Reid  (1953)  also  trained  albino  rats  on  a  black -white  dis¬ 
crimination,,  and  noted  that  after  stimulus  reversal*  the  greatest  number 
of  repetitions  of  the  original  response*  but  also  the  fastest  learning 
of  the  new  response  to  the  reversed  cues*  occurred  in  the  most  highly 
trained  groups*  Hie  suggested  that  in  relatively  early  stages  of  learn¬ 
ing*  an  animal  simply  learns  to  make  the  response  required  to  obtain 
reinforcement*  but  with  increasing  amounts  of  training*  the  animal 
learns  to  respond  to  the  whole  set  of  stimuli  of  which  a  specific 
subset  are.  relevant*  in  the  sense  of  procuring  reinforcement*  His 
highly -trained  animals  learned  to  look  at  the  black  or  white  stimulus 
before  responding*  Those  with  less  training  instead  responded  in  terms 
of  position  habits  and  to  irrelevant  stimuli*  other  than  the  black  and 
white  cards* 

More  recently*  Brookshire*  Warren*  &  Ball  (1961)  investigated 
the  influence  of  overtraining  on  response  reversal  within  a  stimulus 
continuum*  as  well  as  on  transfer  between  dimensions  -  specifically* 
response  learning  (left  or  right  turn)  in  comparison  with  place  learn- 
ing  (response  to  black  or  white).  The  authors  maintained  that  neither 
Reid's  nor  Oapaldi  &  Stevenson's  hypothesis  was  upheld*  even  though 
empirical  observations  were  the  same*  i.e*  for  rats*  overtraining 

facilitated  response  reversal,.  It  did  not  affect  inter -dimensional 

i 

transfer.  If  rats  learn  to  make  discrimination  responses  to  a  set  of 
stimuli  varying  within  a  single  dimension*  Reid's  hypothesis  should 
predict  that  inter-dimensional  transfer  should  be  hindered  by  over- 

i. 

training*  But*  according  to  Capaldi  &  Stevenson's  position*  overtrain¬ 
ing  should  facilitate  both  inter-  and  intra-dimensional  transfer*  since 

1 

extinction  of  the  original  habit  would  still  be  more  rapid  following 


26 


overtraining o  Brookshire ,  Warren,  &  Ball  suggest  that  all  the  observa¬ 
tions  might  best  be  described  in  terms  of  Lawrence ’ s  (19^9)  hypothesis 
that  with  overtraining,  cues  along  a  given  stimulus  dimension  become 

r 

more  "distinctive",  i.e.  a  change  in  the  perceptual  properties  of 
stimuli  operates  to  facilitate  future  discriminations  along  that 
dimension,  but  does  not  influence  transfer  to  another  stimulus 
dimension. 

Responding  during  extinction,  and  on  transfer,  both  inter - 
and  intra-dimensional,  following  a  wide  range  of  amounts  of  training 
on  problems  of  various  difficulty,  should  be  further  investigated,  to 
test  both  the  reliability  of  observed  effects  of  overtraining,  and  the 
several  hypotheses  which  have  attempted  to  account  for  these  effects. 


27 


Summary 


The  purpose  of  this  investigation  was  to  ascertain  to  what 
extent  "resistance  to  extinction"  is  a  function  of  conditioning  rein¬ 
forcements  when  spurious  responses  are  not  recorded*  In  order  to 

»• '  * 

minimize  spurious  responses.,  a  very  inconspicuous  bar  was  used* 

After  extensive  pretraining,  each  of  120  male  rats  was  allowed  0*  4, 

i  /  /  •  j, 

12,  88,  or  108  reinforced  bar -presses*  Extinction  over  2  or  4  hours 

occurred  the  following  day*  Number  of  responses  p£r  minute  differ¬ 
entiated  groups  primarily  during  the  first  6  minutes  of  extinction, 
though  a  significant  effect  of  reinforcements  was  apparent  throughout 

r 

the  first  hour.  The  results  were  related  to  constructs  of  frustration 
and  inhibition.  The  interaction  between  reinforcements  and  time, 
i.e.  the  highest  contingency  between  extinction  responding  and  number 
of  reinforcements  was  between  1  and  3  minutes  and  decreased  thereafter, 
as  well  as  the  trend  for  the  1,08 -reinforcement  group  to  exhibit  less 
resistance  to  extinction  than  the  88-reinforcement  group,  question  the 

„  •  L  1  I 

common  use  of  resistance  to  extinction  as  a  measure  of  habit  strength* 
These  results  suggest  a  need  to  further  investigate  the  nature  of  the 
extinction  f unction  over  a  wide  range  of  reinforcements  and  times,  and 
perhaps  to,  revise  the  usually  assumed  relation  between  persistence  of 
a  habit  and  number  of  conditioning  reinforcements* 


28 


References 


Amselj  A.  The  role  of  frustrative  nonreward  in  noncontirruous  reward 
situations.  Psychol.  Bull. ,  1958,  55,  2,  102  -  119. 

Amsel,  A.,  Ernhart,  0*  B.  ,  &  Galbrecht,  C.  R.  Magnitude  of  frustration 
effect  and  strength  of  antedating  goal  factors. 

Psychol.  Rep.,  1961,  8,  183  -  186. 

Amsel,  A.  &  Hancock,  W.  Motivational  properties  of  frustration:  III. 

Relation  of  frustration  effect  to  antedating  goal  factors. 

J.  exp.  Psychol.,  1957,  53,  2,  126  -  131. 

Brookshire,  K.  H. ,  Warren,  J.  M. ,  &  Ball,  G.  G.  Reversal  and  transfer 
learning  following  overtraining  in  rat  and  chicken. 

J.  comp,  physiol.  Psychol.,  1961,  54,  1*  9$  -  102. 

Campbell,  S.  L.  Resistance  to  extinction  as  a  function  of  number  of 
shock-termination  reinforcements. 

J.  comp,  physiol.  Psychol.,  1959,  52,  6,  754  -  758. 

Capaldi,  E.  J.  The  effect  of  different  amounts  of  alternating  partial 
reinforcement  on  resistance  to  extinction. 

Amer.  J.  Psychol. 9  1957*  70*  451  -  452. 

Capaldi,  E.  J.  The  effect  of  different  amounts  of  training  on  the 

resistance  to  extinction  of  different  patterns  of  partially 
reinforced  responses. 

J.  comp,  physiol.  Psychol.,  1956*  51*  3^7  -  371* 

.  1 

Capaldi,  E.  J.  &  Stevenson,  H.  W.  Response  reversal  following 
different  amounts  of  training. 

J.  comp,  physiol.  Psychol.,  1957*  50,  2,  195  -  196* 

Edwards,  A.  L.  Experimental  design  in  psychological  research. 

(Rev.  ed.  j  New  York:  Holt,  Rinehart  and  Winston,  i960. 

Finger,  F.  W.  The  effect  of  varying  conditions  of  reinforcement  upon 
a  simple  running  response. 

J.  exp.  Psychol.,  1942,  30,  53  -  68.  (a) 

Finger,  F.  W.  Retention  and  subsequent  extinction  of  a  simple  running 
response  following  varying  conditions  of  reinforcement. 

J.  exp.  Psychol.,  1942,  31*  120  -  133.  (b) 

Harris,  P.  &  Nygaard,  F.  E.  Resistance  to  extinction  and  number  of 
reinforcements.  Psychol.  Rep.,  1961,  8,  233  -  234. 

Hull,  C.  L.  Principles  of  behavior.  New  York:  Appleton-Century- 
Cr of t s Inc.  ,19^3. 

Hull,  C.  L.  A  behavior  system.  New  Haven:  Yale  University  Press, 
1952. 


29 


Lawrence ,  D.  H.  Acquired  distinctiveness  of  cues:  I.  transfer 

between  discriminations  on  the  basis  of  familiarity  with 

the  stimulus.  J.  exp0  Psychol. ,  1949,  39,  6,  770  -  784. 

♦ 

Mote,  F.  A0  The  effect  of  different  ■  amounts  of  reinforcement  upon 

the  acquisition  and  extinction  of  a  simple  running  response. 
J.  exp.  Psychol. 9  1944,  34,  2l6  -  226. 

Murillo,  N.  R.  &  Capaldi,  E.  J,  The  role  of  overlearning  trials  in 
determining  resistance  to  extinction.  J.  exp.  Psychol. , 

1961,  61,  k,  345  -  3^9. 

Worth,  A.  J.  &  Stimmel,  D.  T.  Extinction  of  an  instrumental  response 
following  a  large  number  of  reinforcements. 

Psychol.  Rep. ,  i960,  6,  227  -  234. 

Perin,  C.  T.  Behavior  potentiality  as  a  joint  function  of  the  amount 
of  training  and  the  degree  of  hunger  at  the  time  of 
extinction.  J.  exp.  Psychol.,  1942,  30,  2,  93  -  113  • 

I 

Reid,  L.  S.  The  development  of  noncontinuity  behavior  through 

continuity  learning.  J.  exp.  Psychol.,  1953,  46,  2,  107  - 

112. 

Schoenfeld,  W.  N.,  Antonitis,  J.  J.,  &  Bersh,  P.  J.  Unconditioned 

response  rate  of  the  white  rat  in  a  bar -pressing  apparatus. 
J.  comp,  physiol.  Psychol.,  1950,  43,  1,  4l  -  48. 

Senko,  M.  G.,  Champ,  R.  A.,  &  Capaldi,  E.  J.  Supplementary  report: 

Resistance  to  extinction  of  a  verbal  response  as  a  function 
of  the  number  of  acquisition  trials. 

J.  exp.  Psychol.,  1961,  6l,  4,  350  -  351* 

Spence,  K.  W.  Behavior  theory  and  learning.  Englewood  Cliffs,  N.J. : 
Prentice “Hall,  Inc.,  i960. 

Wagner,  A.  R.  The  role  of  reinforcement  and  nonreinforcement  in  an 
’’apparent  frustration  effect". 

J.  exp.  Psychol.,  1959,  57,  2,  130  -  136. 

Williams,  S.  B.  Resistance  to  extinction  as  a  function  of  the  number 
of  reinforcements.  J.  exp.  Psychol. ,  1938,  23,  506  -  522. 

Youtz,  R.  E.  P.  Reinforcement,  extinction,  and  spontaneous  recovery 
in  a  non-Pavlovian  reaction. 

J.  exp.  Psychol.,  1938,  22,  305  -  318.  (a) 

Youtz,  R.  E.  P.  The  change  with  time  of  a  Thorndikian  response  in 
the  rat.  J.  exp.  Psychol.,  1938,  23,  128  -  l40.  (b) 


30 


